Robust Bayesian cluster enumeration based on the t distribution

نویسندگان

چکیده

A major challenge in cluster analysis is that the number of data clusters mostly unknown and it must be estimated prior to clustering observed data. In real-world applications, often subject heavy tailed noise outliers which obscure true underlying structure Consequently, estimating becomes challenging. To this end, we derive a robust enumeration criterion by formulating problem as maximization posterior probability multivariate tν distributed candidate models. We utilize Bayes’ theorem asymptotic approximations come up with possesses closed-form expression. Further, refine derivation provide for sets finite sample size. The criteria require an estimate parameters each model input. Hence, propose two-step algorithm uses expectation partition calculation one criteria. performance proposed tested compared existing methods using numerical real experiments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Bayesian Filtering and Smoothing Using Student's t Distribution

State estimation in heavy-tailed process and measurement noise is an important challenge that must be addressed in, e.g., tracking scenarios with agile targets and outlier-corrupted measurements. The performance of the Kalman filter (KF) can deteriorate in such applications because of the close relation to the Gaussian distribution. Therefore, this paper describes the use of Student’s t distrib...

متن کامل

An Extension of the Birnbaum-Saunders Distribution Based on Skew-Normal t Distribution

In this paper, we introducte a family of univariate Birnbaum-Saunders distributions arising from the skew-normal-t  distribution. We obtain several properties of this distribution such as its moments, the maximum likelihood estimation procedure via an EM-algorithm and a method to evaluate standard errors using the EM-algorithm. Finally, we apply these methods to a real data set to demonstr...

متن کامل

A Novel Bayesian Cluster Enumeration Criterion for Unsupervised Learning

The Bayesian Information Criterion (BIC) has been widely used for estimating the number of data clusters in an observed data set for decades. The original derivation, referred to as classic BIC, does not include information about the specific model selection problem at hand, which renders it generic. However, very little effort has been made to check its appropriateness for cluster analysis. In...

متن کامل

Robust mixture regression using the t-distribution

The traditional estimation of mixture regression models is based on the normal assumption of component errors and thus is sensitive to outliers or heavy-tailed errors. A robust mixture regression model based on the t−distribution by extending the mixture of t−distributions to the regression setting is proposed. However, this proposed new mixture regression model is still not robust to high leve...

متن کامل

Robust mixture modelling using the t distribution

Normal mixture models are being increasingly used to model the distributions of a wide variety of random phenomena and to cluster sets of continuous multivariate data. However, for a set of data containing a group or groups of observations with longer than normal tails or atypical observations, the use of normal components may unduly affect the fit of the mixture model. In this paper, we consid...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Signal Processing

سال: 2021

ISSN: ['0165-1684', '1872-7557']

DOI: https://doi.org/10.1016/j.sigpro.2020.107870